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Abstract. We show how a commutative monad gives rise to a theory of extensive quantities, includ- 
ing (under suitable further conditions) a differential calculus of such. The relationship to Schwartz 
distributions is dicussed. The paper is a companion to the author's "Monads and extensive quantities", 
but is phrased in more elementary terms. 

Introduction 

Quantities of a given type distributed over a given space (say distributions of smoke in a 
given room) may often be added, and multiplied by real scalars - ideally, they form a real 
vector space. Lawvere stressed that the dependence of such vector spaces on the space 
over which the quantities in question are distributed, should be taken into account; in fact, 
the dependence is functorial. The viewpoint leads to a distinction between two kinds of 
quantities: the functorality may be covariant, or it may be contravariant: In this context, the 
covariant quantity types are called extensive quantities, and the contravariant ones intensive 
quantities. This usage is an attempt to put mathematical precision into the use of these terms 
in classical philosophy of physics. Mass distribution is an extensive quantity; mass density 
is an intensive one. Lawvere observed that extensive and intensive quantites often come 
in pairs, with a definite pattern of mutual relationship, like the homology and cohomology 
functors on the category of topological spaces. 

In lfT4l . we showed how such a pattern essentially comes about, whenever one has a 
commutative monad T on a Cartesian Closed Category S (where S is meant to model some 
category of spaces, not specified further). 

Such a monad is in particular a covariant endo-functor on S, so the emphasis in our 
theory is the covariant aspect: the extensive quantities. We attempt to push a theory of these 
as far as possible, with the intensive quantities in a secondary role. 

This in particular applies to the differential calculus of extensive quantities on the line 
R, which will be discussed in the last Sections[8]and|9l here, we also discuss the relationship 
to the theory of Schwartz distributions of compact support (these have the covariant func- 
torality requested for extensive quantities, and is a basis for a classical version of a theory 
of extensive quantities). 

The article 1 15 1 by Reyes and the author develop some further differential calculus of ex- 
tensive quantities, not only in dimension 1, as here; but it is couched in terms of the Schwartz 
(double dualization) paradigm, which we presently want to push in the background. 
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1 Monads and their algebras 



The relationship between universal algebra, on the one hand, and monadfl on the category 
of sets on the other, became apparent in the mid 60s, through the work of Linton, Manes, 
Kleisli, and many others: 

If r = (T,T],fi) is such a monad, andX is a set, an element P G T(X) may be interpreted 
as an X-ary operation on arbitrary T-algebras B = (fi,j8): if <p : X — > B is an X-tuple of 
elements in B, we can construct a single element (P,<j>) G B, namely the value of 

T(X) -lifL T(B) — - - B 

on the element P G T(X). Then every morphism / : B — > C of T-algebras is a homomor- 
phism with respect to the operation defined by P. There is also a converse statement. 

The monad-theoretic formulation of universal algebra can be lifted to symmetric 
monoidal closed categories § other than sets, provided one considers the monad T to be 
$ -enrichecQ, in particular, it applies to <?-enriched monads on any cartesian closed cate- 
gory S. 

Recall that for any functor T : $ — >■ S, one has maps hom(X,F) — > hom(T(X),T(Y)), 
sending / G hom(X,Y) to T(f) G hom(T(X),T(Y)); the if-enrichment means that these 
maps not only can be defined for the hom-sets, but for the hom-objects, so that we get maps 
(the "strength" of T) 

stx.y : Y x -> T{Y) T{X) ; (1) 

Y x , as an object of S carries more structure than the mere set of maps from X to Y, e.g. it 
may carry some topology, if § happens to be of topological nature. 

This widening of the scope of monad theoretic universal algebra was documented in a 
series of articles by the author in the early 1970s, cf. |]6], (El, ©, 0, iTTOl . In particular, the 
formulation makes sense for cartesian closed categories, which is the context of the present 
note. Via this formulation, it makes contact with functional analysis, because the basic logic 
behind functional analysis is the ability to form function spaces, even non-linear ones. For 
instance, the category S of convenient vector spaces, and the smooth (not necessarily non- 
linear maps) is a cartesian closed category, where several theories of functional analysis have 
natural formulation, e.g. the theory of distributions (in the sense of Schwartz and others). 

We expound here some aspects of the relationship between the theory of strong monads 
and functional analysis, by talking about § as if it were just the category of sets, and where 
(f-enrichment therefore is automatic. (A rigourous account of these aspects for general 
cartesian closed categories S is given in |14|.) So our exposition technique here is in the 
spirit of the "naive" exposition of synthetic differential geometry, as given in fl3l . say. 

One aim of the theory developed in fl4l . and expounded "synthetically" in the present 
note, is to document that the space T(X) may be seen as a space of extensive quantities (of 
some type) on X, in the sense of Lawvere. So we prefer to talk about a P G T(X) as an 
extensive quantity on X, rather than as "an X-ary operation, operating on all T-algebras B". 

'For the notion of monad, and algebras for a monad, the reader may consult |2|. 
2 for these notions, the reader is again referred to Q- 
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To make this work well, one must assume that the monad T is commutativJ^, see Section [2] 
and[3] 

A main emphasis in Lawvere's ideas is that the space of extensive quantities onX depend 
(covariant) functorially on X; in the present context, functorality is encoded by the fact that 
T is a functor. So for / : X — > Y, we have T '(/) : T (X) — > T (Y); when T is well understood 
from the context, we may write /* for T(f); this is a type of notation that is as old as the 
very notion of functor (recall homology!). 

Thus, the semantics of P € T(X), as an X-aiy operation an T-algebras may be 

rendered in terms of a pairing 

(PJ):=Pfa(P)), (2) 

where <p : X — > B is a map ("an X-tuple of elements of B"). This semantic aspect of extensive 
quantities is essential in Section [9] where it is seen as the basis of a synthetic theory of 
Schwartz distributions. 

For completeness, let us indicate how the pairing is defined without using individual 
elements P G T(X) and G B x , as a map 

T(X) xB x 

namely utilizing the assumed enrichment (Q} of T over S : 

, , y T(X) x st , . , , T i Y \ ev , . 6 
r(Z) x B x — i— ^ ► x T(B) T{X > ► T{B) — B; 

here, ev denotes the "evaluation" map (part of the cartesian closed structure of £). Hence- 
forth, we shall be content with using "synthetic" descriptions, utilizing elements. 

We note the following naturality property of the pairing: for / : X — >■ Y, P £ T(X) and 
V/ : Y — > B, we have 

(MP), W ) = (P,f*( V )), (3) 
where f*(y/) := ty°f- For, the left hand side is j3(v/*/*P), and the right hand side is 

but = Y*°f* smce T is a functor. 

Similarly, if B — (B,j3) and (C = (C,y) are /"-algebras, and F : B ->• C is a T- 
homomorphisnfl we have, for P G ^PO and <j) :X — > B that 

F((P,<j)))^(P,Fo(j,)). (4) 

This is an immediate consequence of F o j3 = 70 T(F), the equation expressing that F is a 
T-homomorphism. 

The terminal object of <§ is denoted 1. The object T(l) plays a special role as the 
algebra of "scalars", and we denote it also R; with suitable properties of T, it will in fact 

3 It should be stressed that to be commutative is a property of enriched (=strong) monads, and enrichment is a 
structure, not a property. However, for the case where £ is the category of sets, enrichment is automatic. 
4 later in this article, r-homomorphisms will be called "T -linear" maps. 
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carry a canonical commutative ring structure, see Section [7] Its multiplicative unit 1 is the 
element picked out by 771. For any X € S, we have a unique map X — >■ 1, denoted ! (when 
X is understood from the context). For P 6 T(X), we have canonically associated a scalar 
tot(P) e r(l), the fofa/ of P, namely 

tot(P) :=!*(P). 

From uniqueness of maps to 1 follows immediately that P and /* (P) have the same total, 
for any / : X — > Y. 

One example of a monad P (on a suitable cartesian closed category of smooth spaces) 
is where T(X) is the space of Schwartz distributions of compact support, we return to 
Schwartz distributions in [9] and they are only mentioned here as a warning, namely that 
functorality is a strong requirement; thus for instance, a uniform distribution on the line can 
never have a total. (The functorial properties of non-compact distributions are not under- 
stood well enough presently.) In (TJ, the authors construct the free real vector space monad 
T, in a category of suitable "convenient" spaces ([16Q, by carving it, out by topological 
means, from the monad of (compactly supported) Schwartz distributions. 

The units r\x : X — > T(X) will, in terms of Schwartz distribution theory, pick out the 
Dirac distributions 8 X ; therefore, we shall allows ourselves the following doubling of nota- 
tion: for x G X, we write 

rj x (x) = 8 X 

(with X understood from the context, on the right hand side). 
Proposition 1 Let B = (B, j3) be a T -algebra, and (j) : X — > B a map. Then 

(8x,4>) = ${x). 

In particular, 

(8 x ,rix) = 8 X . 

Proof. In elementfree terms, the first equation says that the composite 

X ??* > T(X) T(B) 13 . B 

equals (j) :X — > B. And this holds, because by naturality of tj, T(<j>) o rjx = T]b 0; but j3 o r)s 
is the identity map on B, by the unitary law for the algebra structure j3 . The second equation 
is then immediate. 

An extensive quantity of the form 8 X has total 1 6 P(l), 

101(5,) = 1. (5) 

For, the composite map 

m T(l) 
X ^ T{X) U T(1)=R 
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equals the composite 

X * 1 !lT(l)=R, (6) 

by naturality of tj w.r.to ! : X — > 1, and this is the map taking value 1 for all ieX. 

Proposition 2 For the total ofP &T(X), we have 

tot(P) = (P,l z ), 
where lx denotes the function X —> R with constant value 1 S R. 

Proof. The map lx is displayed in (0 above, so (P, lx) is by definition the result of applying 
to P £ T(X) the composite 

T (x) T ^ > r(i) r ^' » r 2 (i) — r(i) =/?. 

Now by one of the unit laws for a monad, the composite of the two last maps here is the 
identity map of T(l), so the displayed composite is just T(\); this is the map which to P 
returns the total of P. 

Notation. We attempted to make the notation as standard as possible. On three occasions, 
this forces us to have double notation, like the rj-8 doubling above, and later E ("expecta- 
tion") for pi, and an "integral" symbol for the pairing of extensive and intensive quantities. 
An exception to standard notation is that the exponential object B x is denoted X rh B, to 
keep it online. (Other online notations have also been used, like [X,B] orl B.) 

2 Tensor product of extensive quantities 

Let T be an algebraic theory, and let P and Q be an X-ary and a T-ary operation of it. Then 
one can define, semantically, an X x T-ary operation P®Q: given an X x T-tuple 9 on the 
T-algebra B; we think of 9 as an matrix of elements of B with X rows and Y columns. Now 
evaluate P on each of the Y columns; this gives a T-tuple of elements of B; then evaluate Q 
on this F-tuple; this gives an element of B. This element is declared to be the value of the 
X x T-ary operation P(g> Q on 9. 

One might instead first have evaluated Q on each of the rows, and then evaluated P on 
the resulting X-tuple; this would in general give a different result, denoted P®Q; the theory 
is called commutative if (g) and <§> agree. 

This kind of tensor product was formulated monad theoretically, and without reference 
to the semantics involving T-algebras, in the author's 1970-1972 papers, so as to be appli- 
cable for any strong monad on any cartesian closed category S\ it takes the form of two 
maps^l natural in X and Y G S , 

% 

T(X)xT(Y) z=t T(XxY); 

(§> 

5 denoted in |6| by ijfx.r and tyx.Y, respetively 
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the monad is called commutative if these two maps agree, for all X and Y. Both ® and (g) 
make the functor T a monoidal functor, in particular they satisfy a well known associativity 
constraint: (P®Q)®S — P®(Q®S), modulo the isomorphisms induced by (X x Y) x Z = 
Xx(Fx Z), and similarly for ®. (The "nullary" part that goes along with (g>, is a map 

Among the equations satsified is ® o {r\ x x rjy) = T]xxr, which in the notation with 8 
reads: for ieX and y g T, 

5»«>^ = 5 M . (7) 

If A = (A, a) and C = (C, 7) are T-algebras, it makes sense to ask whether a map A x 
X — > C is a T-homomorphism in the first variable, cf. [9 1; and similarly it makes sense to 
ask whether a map X x A — > C is a T-homomorphism in the second variable. We shall use 
the term "7 '-linear map" as synonymous with T-homomorphism; this allows us to use the 
term T -bilinear for a map A x B — > C which is a T-homomorphism in each of the two input 
variables separately (where A,B, and C are T-algebras), and similarly "T-linear in the first 
variable", etc. 

If / : X x Y — > C is any map into a T-algebra C, it extends uniquely over r\x x Y to a 
map x F — > C which is T-linear in the first variable. Similarly / extends uniquely 
over X x rjy to a map X x r(T) — > C which is T-linear in the second variable. However, a 
map X xY does not necessarily extend to a r-bilinear T(X) x T(Y) — > C; for this, one 
needs commutativity of T: 

3 Commutative monads 

We henceforth consider a commutative monad T = (T, 17, /i) on S . The reader may have 
for instance the free-abelian-group monad in mind. 

From [9 1, we know that commutativity of T is equivalent to the assertion that jg) : T(X) X 
r(T) — T(X x T) is r-bilinear, for all X and Y. (In the non-commutative case, ® will only 
be T-linear in the second variable, and <£> will only be T-linear in the first variable.) 

Then if C = (C, 7) is a T-algebra, any map / : X x Y — > C extends uniquely over rjx x I7r 
to a r-bilinear map T(X) x T(Y) —> C. Since / also extends uniquely over rjxxY to a 
T-linear T(X xY) C, one may deduce that <g> : r(X) x T(Y) T(X x Y) is in fact a 
universal r-bilinear map out of T(X) x T(Y). 

If Z? = (B,/3) is a T-algebra, and X is an arbitrary object, X rtl B carries a canonical 
"pointwise" T-algebra structure inherited from /3 . In the category of sets, this is just the 
"coordinatwise" T-algebra structure on YlxB. 

Let (A, a) and (B,j3) be T-algebras. If S has sufficiently many equalizers, there is a 
subobject A rhr B of A ftl B, which in the set case consists of those maps A — > B which 
happen to be T-linear. With T commutative, the subobject A rhr B C A rh B is in fact a 
sub-F-algebra, cf. jSJ. 

If A = (A, a), B = (B,j3), and C = (C, 7) are T-algebras, a map A x B — > C is r-bilinear 
iff its transpose A — > B rtl C is T-linear, and factors through the subalgebra B rtlr C. 
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In JT4 1, Theorem 1, we prove that for a commutative monad T, the exponential adjoint 
of the pairing T(X) x (X rtl B) —> B is a r -bilinear map T(X) — > (X rh B) rhr 5; so therefore 
also, we have 

Theorem 1 77ze pairing T(X) x (X &\ B) - — - — — B is T -bilinear, for any 7 '-algebra B. 

4 Convolution 

If a : X x Y -)• Z is any map, and P e g e T(Y), we may form a*(P ® Q) 6 T(Z), 

called the convolution of P and Q along a, and denoted P * a Q. Since <E> is T-bilinear and 
a* = T(a) is T-linear, it follows that P* a Q depends in a T-bilinear way on P, Q. In diagram, 
convolution along a is the composite 

t(x) x t(y) -S- r(x x y) r(z). 

The convolution along the unique map 1 x 1 > 1 gives a multiplication on = T(l), which 
is commutative. 

A consequence of the naturality of eg) w.r.to the maps ! : X — > 1 and ! : Y —> 1 is that 

tot(p®e)=tot(p)-tot(e), (8) 

where the dot denotes the product in R = T(l). Note that this product is itself (modulo the 
identification T(l X 1) = T(l)) a tensor product, T{1) x T{1) ->• T(l x 1) = T(l). 
We have, for i£X and y <EY 

8 X * a 5y = 8 a ^, y ) ■ (9) 
This follows from O, together with naturality of 77 w.r.to a, which in 5-terms reads 

T ( a )($(x.y)) = 8 a (x,y)- 

If a is an associative operation X x X — >• X, it follows from properties of monoidal functors 
that the convolution along a, T(X) x T(X) — s> T(X) is likewise associative. If a is commu- 
tative, commutativity of convolution along a will be a consequence, but here, one uses the 
assumption that the monad T is commutative. 

5 The space of scalars R := T(l) 

The space T(l) plays the role of the "ring" of scalars, or number line. It has a T-linear 
structure, since it is a T-algebra, and it carries a T-bilinear multiplication m, namely 

r(i)xT(i) -S- r(ixi)~r(i), 
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which is commutative and associative. The multiplicative unit is picked out by 771 : 1 — ► 
T(l) = R. - Also R acts on any T(X), by 

T(l) x T(X) — - T(l xl) = (10) 

generalizing the description of multiplication on /?; it is denoted just by a dot •. This action is 
likewise r -bilinear, and unitary and associative. These assertions follow from T-bilinearity 
of Cg), and the compatibility of <E> with rj. 

There are properties of T which will imply that T-algebras carry abelian group structure; 
in this case, R is a commutative ring, with the above m as multiplication, and any T(X) is 
an /?-module, with T-linear maps being in particular /?-linear, see Section|7] 



6 Intensive quantities, and their action on extensive quan- 
tities 

The space R — T(l) is a T-algebra, with a commutative r -bilinear commutative monoid 
structure (so in the additive case, Section [7] it is in particular a commutative ring). From 
general principles^ follows that for any X, the space X (hR=X rfl T(l) inherits a T-algebra 
structure and a r -bilinear monoid structure; in fact — rtl R is a contravariant functor with 
values in the category of monoids whose multiplication is r -bilinear. If / : X — > Y , the map 
/ ftl /? : F rtl /? — }X (t\R preserves this structure. The map / rtl R is denoted /*, and is a kind 
of companion to the covariant : T(X) — > T(Y). In the terminology of Lawvere [18], X rtl/? 
is a space of intensive quantities on X. Note that T only enters in the form of R — T(l). 

The monoid X rtl R =X rtl T(l) acts on any space of the form X rtl T(Y), by a simple 
"pointwise" lifting of the action of T(l) on T(Y), described in dTOb . (with Y instead of X): 

{X rtl r(l)} x {X rtl T(Y)}=X(h {T(l) x T(Y)} - - ^ 1,y I X rh T(l x Y) ^ X it T(Y). 

(The monoid structure on R — T(l) is a speial case.) 

We shall describe an action of the monoid X rtl R on T{X). It has a special case the 
"multiplication of a distribution by a function" known from (Schwartz) distribution theory^. 
Notationally, we let the action be from the right, and denote it h, 

T(X) x(Xrf\R) — ► T(X). 

It is T-linear in the first variable^: it is the 1-T-linear extension over rixxR of a certain 
map X x (X rtl /?) — >• T(X); in other words, we describe first P h for the case where 

6 it is an aspect of the strength of the monad T that the category <g T of T-algebras is <?-enriched; it is even 
cotensored (cf. [ 2,]) over S\ and then X rtl B is the cotensor of the space X with the T-algebra B, for any T-algebra 
B. This then in particular applies to B = T(l). For details, see e.g. 1141 . 

7 also, it is analogous to the cap-product action of the cohomology ring (cup product) on homology, cf. e.g. j4] 

8 it is actually T-bilinear. 
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P = 77 (x) = 8 X for some x 6 X, and where <j> : X — > R is any function. Namely, we put 

8 X h := <p(x)-8 x , 

recalling that r(X) carries a (left) action • by 7? = T(l) (of course "left" and "right" does 
not make any difference here, since the monoids in question are commutative). 

Since T-linearity implies homogeneity w.r.to multiplication by scalars X in R, we have 
in particular that 

A • (S* h 0) = (A • St) h (11) 

We shall prove that the action h is unitary and associative. The unit of X rtl R is lx, i.e. the 
function with constant value 1 € R. So we should prove P h lx = P. By T-linearity in the 
first variable, it is enough to see it when the input P from T(X) is of the form 8 X , so we 
should prove, for the unitary property, 

8 X h l x = St 

and similarly, for the associative property, it suffices to prove 

(S*h4>)hyA=S,h(0-yA) 

for (j) and y arbitrary functions X — >• R. The first equation then is a consequence of lx (x) = 
1; unravelling similarly the second equation, one sees that the two sides are, respectively 
(<j>(x) ■ y(x)) ■ 8 X (using (fTTTi). and (<p ■ y/)(x), and their equality is a consequence of the 
pointwise nature (and commutativity) of the multiplication in R. 

The following result serves in the Schwartz theory in essence as the definition of the 
action h of intensive quantities on extensive ones. Recall that the multiplicative monoid of 
R acts on any T-algebra of the form T(Y), via ® 1>y : T(l) x T(Y) -» T(l xY) = T(Y). The 
action is temporarily denoted H; for Y = 1, it is just the multiplication on R. This action 
extends pointwise to an action H of X rh R on X rtl T(T). 

Proposition 3 Let <j) be a function X — y R, and let \j/ be a function X — > T(Y). Then for any 

p e t(x), 

(P^<j),y) = {P,(l)-\y)GT{Y). 

Proof. Since both sides of the claimed equation depend in a T-linear way on P, it suffices 
to prove the equation for the case where P is 8 X for some x G X. We calculate the left hand 
side: 

(8 x \-<j>,Y) = (<j>(x) ■ 8 X , y/} = <j)(x) ■ (8 X , yr) =<j>{x)- y(x), 
using Proposition[T] and the right hand side similarly calculates 

(St,0 H y) = {(j> H yr)(x), 

which is likewise <p(x) H y/(x), because of the pointwise character of the action H on X rtl 
T(Y). 
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Corollary 1 The pairing (P, <j)) (for P S T(X) and <j) <EX ft\ R) can be described in terms of 
h as follows: 

(P,^)=tot(Ph^). 
Proof. Take Y = 1 (so T{Y) = R), and take y/ = l x . Then 

using (0, and then the Proposition. But (j)-lx = <j>. 

7 Additive structure 

We shall in the present Section describe a simple categorical property of the monad T, which 
will guarantee that 'T-linearity implies additivity", even "7?-linearity" in the sense of a rig 
R <E S ("rig"= commutative semiring), namely R = T(l). This condition will in fact imply 
that <§ T is an additive (or linear) category. 

We begin with some standard general category theory, namely a monad T = (T, rj , /J.) on 
a category which has finite products and finite coproducts. (No distributivity is assumed.) 
So S has an initial object 0. If T(%) G § is a terminal object, then the object (T (0) , is 
a zero object in $ , i.e. it is both initial and terminal. It is initial because T, as a functor 
S — > S 1 , is a left adjoint, hence preserves initials; and since T(%) = 1, it is also terminal (the 
terminal object in <8 T being 1 6 § , equipped with the unique map T(l) — > 1 as structure). 
This zero object in <S T we denote 0. Existence of a zero object in a category implies that 
the category has distinguished zero maps 0a,b : A — > B between any two objects A and B, 
namely the unique map A — > B which factors through 0. For S 1 , we can even talk about 
the zero map x ,b '■ X ->• B, where X and B = (B,j8) 6 S 1 ', namely x ,b is Tf]x followed 
by the zero map Qj(x),b '■ T(X) — > B. We have a canonical map X + Y — >• T(X) x T(T): the 
composite X — > X + Y ^ T (X) x T (Y) is (t]x,0x,t{y)) (here, the first map is the coproduct 
inclusion map ). Similarly, we have a canonical map Y —5- T(X) x T(Y). Using the universal 
property of coproducts, we thus get a canonical map (j) X y ■ X + Y — >• T(X) x T(Y). It extends 
uniquely over r\x+Y :X + Y— > T(X + Y) to a T-linear map 

<P x ,y--T(X+Y)^T(X)xT{Y), 

and <I> is natural in X and in Y. We say that T :§ —> S T takes binary coproducts to products 
if <&x,y is an isomorphism (in S or equivalently in t§ T ) for all X, Y in $ . Note that the 
definition presupposed that T(0) = 1; it is the zero object in S 1 , so that if T takes binary 
coproducts to products, it in fact takes finite coproducts to products, in a similar sense. So 
we can also use the phrase "T takes finite coproducts to products" for this property of T. 

We define an "addition" map in S T ; it is a map + : T(X) x T(X) to T(X), namely the 
composite 

®xy TCV) 
T(X) x T(X) '-* T(X+X) K T(X) 
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where V : X + X — > X is the codiagonal. So in particular, if /n, denotes the /th inclusion 
(i = 1,2) ofX intoX+X, we have 

= [rx — ^ r(x +z) — i rx x rx — - tx] . (12) 

Note that this addition map is T-linear. Under the identification T(X) = T(X + 0) = x 
1, the equation ( TT2T > can also be read: r(!) : T(0) — > is right unit for +, and similarly 
one gets that it is a left unit. 

We leave to the reader the easy proof of associativity and commutativity of the map 
+ : T(X) x T(X) — > T(X). It follows that T(X) acquires structure of an abelian monoid in 
S T (and also in S). 

For an abelian monoid A in any category, we may ask whether A is an abelian group or 
not (so there is a "minus" corresponding to the +); existence of such "minus" is a property 
of A, not an added structure. If T is a monad which takes finite coproducts to products, it 
makes sense to ask whether the canonical monoid structure which T-algebras in this case 
have, is actually an abelian group structure; it is therefore a property on such T, not an added 
structure. We shall henceforth assume this property, since we need "minus" (= difference) 
for differential calculus. 

In |fl4|, we proved that 

Proposition 4 Every T -linear map T(X) — » T(Y) is compatible with the abelian group 
structure. 

We again assume that T is a commutative monad. Recall that we then have the T- 
bilinear action T(X) x T(l) — > T(X). It follows from the Proposition that it is additive in 
each variable separately. 

We have in particular the r-bilinear commutative multiplication m : T(l) x T(l) — > 
T(l), likewise bi-additive, m(x + y,z) — m(x,z) +m(y,z), or in the notation one also wants 
to use, 

(x+y) -z = x-z+y-z, 
so that R = T(l) carries structure of a commutative ring. We may summarize: 

Proposition 5 Each T(X) is a module over the ring R = T(l); each T -linear map T(X) — > 
T(Y) is an R-module morphism. 

It is more generally true that T-linear maps A — > B (for A and B £ S T ) are /^-module maps. 
We shall not use this fact. 

The property of T that it "takes finite coproducts to products" accounts for a limited 
aspect of contravariance for extensive quantities: if a space X is a coproduct, X =X\ +X2, 
the isomorphism T(X\ +X2) = T(X\) x T(Jii) implies that an extensive quantity P G T{X) 
gives rise to a pair of extensive quantities Pi G T{X\) and P% £ TiXi), which one may 
reasonably may call the restrictions of P to X\ and Xi, respectively. Now, restriction is a 
contravariant construction, and applies as such, for intensive quantities, along any map. For 
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extensive quantities, and for monads T of the special kind studied here, it applies only to 
quite special maps, namely to the inclusion maps into finite coproducts, like X\ — > X\ +X2- 

We leave to the reader to philosophize over the extent to which, given a distribution of 
smoke in a given room, it makes sense to talk about "the distribution of this quantity of 
smoke, restricted to the lower half of the room". 

(For distributions in the Schwartz sense, one may construct some further "restriction" 
constructions (restriction to open subsets); this is an aspect of the fact that the corresponding 
intensive quantities (= smooth functions) admit an "extension" construction from closed 
subsets.) 

For u £R,we have the translation map a" ; R — >R given by x i— > x + u. If P G T(R), we 
have thus also eT(R). 

We have the following reformulation of the translation maps in terms of convolution 
along the addition map + : RxR—t R: 

Proposition 6 For any P GT(R) and a GR, 

a?(P) = 8 a *P = P*8 a . 

Proof. The second equation follows from commutativity of +. To see the first equation, we 
observe that both sides of a"{P) = 8 a *P depend T-linearly on P; so it suffices to prove this 
equation for the case where P is of the form 5^ for b G R. But 05°(5^) = 8 a+ b = 8j, +a . 

In particular, we see that 80 is a neutral element for convolution, 80* P = P * 8q = P. 

8 Differential calculus of extensive quantities on R 

We attempt in this Section to show how some differential calculus of extensive quantities G 
T(R) may be developed on equal footing with the standard differential calculus of intensive 
quantities (meaning here: functions defined on R). For this, we assume that the monad T 
on S has the properties described in Section|7] so in particular, R is a commutative ring. To 
have some differential calculus going for such R, one needs some further assumption. 

Consider a commutative ring R. Assume D C R is a subset satisfying the following 
"KL"-axiom: 

for any f : R — > R, there exists a unique /':/?—> R such that for all x G R 

f(x+d)=f(x)+d-f'(x) foralldeD. (13) 

Example: 1) models of synthetic differential geometry, with D the set of d G R with 
d 2 = (the simple "Kock-Lawvere" axiom says (cf. e.g. ifTTI ') a little more than this, namely 
it also asks that any function / : D — » R extends to a function /:/?—> R.) 

2) Any commutative ring, with D = {d} for one single invertible d G R- In this case, for 
given /, the /' asserted by the axiom is the function 

f(x) = ^-(f(x + d)-f(x)), 
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the standard difference quotient. 

Similarly, if V is an /^-module, we say that it satisfies KL, if for any / : R — > V, there 
exists a unique /':/?—> V such that ( TT31 holds for all x£R. 

In either case, we may call /' the derivative of /. 

It is easy to see that any commutative ring R is a model, using {d} as D, as in Example 2) 
(and then also, any /^-module V satisfies then the axiom); this leads to some calculus of finite 
differences. Also, it is true that if £ is the category of abstract sets, there are no non-trivial 
models of the type in Example 1); but, on the other hand, there are other cartesian closed 
categories S (e.g. certain toposes containing the category of smooth manifolds, cf. e.g. ifTTl ). 
and where a rather full fledged differential calculus for intensive quantities emerges from 
the KL-axiom. 

We assume that R = T(l) satisfies the KL-axiom (for some fixed D C R), and also that 
any /^-module of the form T(X) does so. 

Proposition 7 (Cancelling universally quantified ds) IfV is an R-module which satisfies 
KL, and v £ V has the property that d ■ V = for all d £ D, then v = 0. 

Proof. Consider the function / : R — > V given by t h->- t ■ v. Then for all x £ R and d £ D 

f(x + d) = (x + d) ■ v = x ■ v + d ■ v, 

so that the constant function with value v will serve as /'. On the other hand, d ■ v = d ■ by 
assumption, so that the equation may be continued, 

= x- v + d-0 

so that the constant function with value G V will likewise serve as /'. From the uniqueness 
of /', as requested by the axiom, then follows that v = 0. 

We are now going to provide a notion of derivative P' for any P £ T(R). Unlike differen- 
tiation of distributions in the sense of Schwartz, which is defined in terms of differentiation 
of test functions (j), our construction does not mention test functions, and the Schwartz def- 
inition (P 1 , 0) := — (P, (j)'} comes in our treatment out as a result, see Proposition[T3lbelow. 

For u — 0, P— a"(P) = £ T(R). Assuming that the /^-module T(R) is KL, we therefore 
have for any P £ T(R) that there exists a unique P' £ T(R) such that for all d £ D, 

d-P' =P-ai(P). 

Since d ■ P' has total for all d £ D, it follows that P' has total 0. 
Differentiation is translation-invariant: using 

a' o a s = a t+s = a s o a' , 

it is easy to deduce that 

(ai(P))' = (a%(P'). (14) 
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Proposition 8 Differentiation of extensive quantities on R is a T -linear process. 

Proof. Let temporarily A : T(R) — > T(R) denote the differentiation process. Consider a 
fixed d eD. Then for any P e T(R), d-A(P) = d-P' is P-afP; it is a difference of the two 
T-linear maps, namely the identity map on T(R) and a c J = T(a d ), and as such is T-linear. 
Thus for each d ED, the map d ■ A : T(R) — > T(R) is T-linear. Now to prove T-linearity of 
A means, by monad theory, to prove equality of two maps T 2 (R) — y T(R); and since d - A 
is r-linear,as we proved, it follows that the two desired maps T 2 (R) —> T(R) become equal 
when post-composed with the map "multiplication by d": T(R) — > T(R). Since d E D was 
arbitrary, it follows from KL axiom for the /^-module T(X) that the two desired maps are 
equal, proving T -linearity. 

The structure map T(R) -> R of the J-algebra R = T(l) is /ij : T 2 (l) -> T(l). Just 
as T) plays a special role, with J](x) being the Dirac distribution 8 X , the structure maps 
for T-algebras play a role that sometimes deserves an alternative notation and name; thus 
in particular /Xi : T(R) — > R plays in the context of probability distributions the role of 
expectation, see [14|, and we shall here again allow ourselves a doubling of notation and 
terminology: 

E(P):=m(P), 

the expectation of P € T(R). It is a scalar £ R. 
Note that for a e R, 

E{8 c ,)=a; (15) 

since 8 a is rjff(a) = T]ym(a), and E = /ij, this is a consequence of the monad law that 
fix o T]t(x) is the identity map of T(X) for any X, in particular for X = 1. 

Proposition 9 LetPeT(R). Then 

E(P') = -tot(P). 

Proof. The Proposition say that two maps T(R) —> R agree, namely £oA and — tot, where 
A, as above, is the differentiation process Pt-^P'. Both these maps are T-linear, so it suffices 
to prove that the equation holds for the case P = 8 X , so we should prove 

E(8^) = -tot(8 x ). 

By the principle of cancelling universally quantified ds (Proposition 17), it suffices to prove 
that for all d G D that 

d-E{8' x ) = -d-tot(8 x ). 
The right hand side is — d, by ©. The left hand side is 

E(d- 8 X ) 

= E(S x -S x+d ) 

= E(8 x )-E(S x+d ) =x-(x + d) = -d, 
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by (15[ . This proves the Proposition. 

The differentiation process for functions, as a map R itl V — > R ftl V, is likewise T-linear, 
but this important information cannot be used in the same way as we used T-linearity of the 
differentiation T(R) — > T(R), since, unlike T(R), R&iV (not even R rti R) is not known to be 
freely generated by elementary quantities like the 8 x s. 

Recall that if F : V — > W is an /^-linear map between KL modules 

F o <j)' = (F o <j))' (16) 

for any : R — > V. 

One can generalize the differentiation of extensive quantities on R to a differentiation of 
extensive quantities on any space X equipped with a vector field. The case made explicit is 
where the vector field is (x, d) n> x + d (or J^) on R. 

Proposition 10 Let P G T(R) andQe T(R). Then 

(P*Q)' =P'*Q = P*Q'. 

Proof. By commutativity of convolution, it suffices to prove that (P * Q) 1 = P' *Q. Both 
sides depend in a r -bilinear way on P and Q, so it suffices to see the validity for the case 
where P = S a and Q = 8/,. To prove (8 a * 8b)' = 8' a * 8b, it suffices to prove that for all d G D, 

d-(8 a *8 h y = d-8' a *8 b , 

and both sides comes out as 8 a+ i, — 8 a+ t,+ c h using that * is /^-bilinear. 

Primitives of extensive quantities 

We noted already in Section Q] that P and f*(P) have same total, for any P G T(X) and 
/ : X ->• Y . In particular, for P G T (R) and d G D, d ■ P' = P - a c J (P) has total 0, so cancelling 
the universally quantified d we get that P' has total 0. 

A primitive of an extensive quantity Q G T(R) is a P G T(R) with P' = Q. Since any P' 
has total 0, a necessary condition that an extensive quantity Q G T(R) has a primitive is that 
tot(2) = 0. Recall that primitives, in ordinary 1 -variable calculus, are also called "indefinite 
integrals", whence the following use of the word "integration": 

Integration Axiom. Every Q G T(R) with tot(Q) = has a unique primitive. 

(For contrast: for intensive quantities § on R (so (j) : R — > R is a function), the standard 
integration axiom is that primitives always exist, but are not unique, only up to an additive 
constant.) 

By /^-linearity of the differentiation process T(R) — > T(R), the uniqueness assertion in 
the Axiom is equivalent to the assertion: ifP' — 0, then P = 0. (Note that P' = implies that 
P is invariant under translations tt^(P) = P for all d G D.) The reasonableness of this latter 
assertion is a two-stage argument: 1) if P' = 0, P is invariant under arbitary translations 
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0%{P) = P- 2) if P is invariant under all translations, and has compact support, it must be 0. 
(Implicitly here is: R itself is not compact.) 

In standard distribution theory, the Dirac distribution 8 a (where a £ R) has a primitive, 
namely the Heaviside "function"; but this "function" has not compact support - its support 
is a half line C R. 

On the other hand, the integration axiom provides a (unique) primitive for a distribution 
of the form 8 a — 8[,, with a and b in R. This primitive is denoted [a,b], the "interval" from a 
to b; thus, the defining equation for this interval is 

[a,b]' = 8 a -8 b . 

Note that the phrase "interval from ... to ..." does not imply that we are considering an 
ordering < on R (although ultimately, one wants to do so). 

Proposition 11 The total of [a,b] is b — a. 

Proof. We have 

tot(Ml) = = ~ E ($ a - S b ) 

by Proposition|9]and the fact that [a,b] is a primitive of 8 a — 8y 

= -E{S a )+E(8 b )=b-a, 

by®. 

It is of some interest to study the sequence of extensive quantities 

[—a, a], [— a, a] * [— a, a], [— a, a] * [— a, a] * [— a, a], 

they have totals 2a, (2a) 2 , (2a) 3 , . . .; in particular, if 2a = 1, this is a sequence of probability 
distributions, approaching a Gauss normal distribution (the latter, however, has presently no 
place in our context, since it does not have compact support). 

9 Extensive quantities and Schwartz distributions 

Recall from (O that P GT(X) gives rise to an X-ary operation on any T-algebraB = (B, j3), 
via (P,<j>) '■ = /3(0*(P)), where <p 6 X rti B. The pairing is thus a map 

T(X)x{X(hB) > B. 

which is T-bilinear (cf. TheoremQ] or 11141 ). We may take the exponential transpose of the 
pairing; this is then a map Tx : T(X) — > (X rhS) &it B. 

The synthetic rendering of Schwartz distribution theory is that (X rtl R) rtlr R "is" the 
space of Schwartz distributions of compact support: X rtl R is the "space of test functions" 
(not necessarily of compact support), and (X rtl R) rtlr R is the space of T-linear functionals 
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X rh R — > R on the space of such test functions. (In some well adapted models S of SDG, 
and for suitable T, this can be proved to be an object whose set of global sections is in fact 
the standard Schwartz distributions of compact support on X, if X is a smooth manifold; cf. 
lfl9l Proposition II. 3. 6 (Theorem of Que and Reyes).) 

Remark. Consider a commutative ring object in a sufficiently cocomplete cartesian closed 
category S '. Let T be the (strong) monad which to X associates the free /^-module on X. 
Thus in particular/? = T(l). The monad X H> (X rh R) (tir R is in general not a commutative 
monad, so cannot agree with T, although in some cases, the monad map T : T — > (— fhi?) (tir 
R is monic. In flj, it is proved that T(X) for a special case (convenient vector spaces) can 
be "carved out" of (X rh R) fhj R by topological means. Other investigations, e.g. in fl2l . 
and a Theorem of Waelbroeck, describe a class of spaces X which "perceive" Tx to be an 
isomorphism. 

To make contact with classical theory and intuition, we introduce, for the third time, 
a doubling of notation (this one is actually quite classical); for P G T(X) and €X (tlB 
(where B is a T algebra), we write 

/ (j)(x) dP{x) := (P,<j))eB, 
Jx 

with x a dummy variable ranging over X. Thus Proposition|2]may be rendered 

tot(f) = / 1 dP(x). 
Jx 

For B = 7/(1) =RandPeT(R) = T 2 (l), the /-notation will help to motivate the use of the 
terminology "expectation of P" for fli{P)', let <p : R — >• R be the identity map. Then 

(P^)=^(P))=^(P), 

since 0* is the identity map of T(R). On the other hand 

/ (j)(x) dP(x) = [ xdP{x), 
Jx Jx 

since (j)(x) = x; this is the standard "integral" expression for expectation E(P) for a proba- 
bility distribution P on R. 

The map %x '■ T(X) —5- (X ftl R) (tly R is not necessarily monic; in the case of classical 
Schwartz distributions, it is monic, which allows the classical theory to identify extensive 
quanties in T(X) with elements in (X ftl R) rtlr R, and so one avoids having to mention 
T(X) explicitly; the notion of extensive quantity on X is thus made dependent on the notion 
of intensive quantity (test function) on X. It is, however, easy to give examples of Ts 
where there are not sufficiently many "test functions" X — » R (with R = T(l)) to make 
%x '■ T(X) — > (X rh R) &\t R injective, whence one motivation for the study of T, independent 
of the introduction of X rh R. 
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The injectivity of x x may be expressed: "To test equality of P and Q in T(X), it suffices 
to test, for arbitrary functions : X — > R, whether (P,<j>) = (Q,(j>)", whence the name test 
function. 

If %x is not injective, there are not sufficiently many such test functions X — > R, but 
there are enough, if we allow test functions with arbitrary T-algebras B = (B,fi) as their 
codomain. We have in fact 

Proposition 12 For any X G S, there exists a T -algebra B so that T X : X — > (X rh B) rhr B 
is monic. 

Proof. Take B = T(X); then the map e : (X rh T(X)) rhr T{X) -> r(X) given by "evaluation 
at Tj x : X -> is left inverse for t x . For, if P 6 then to say e(x x (P)) = P is 

equivalent to saying 

{P,T lx )=P- (17) 
Since the structure map of the T-algebra T(X) is jx x , the definition (01 of the pairing (P, T] x ) 
gives (P,T]x) — Hx{f]*(P)) (where 77 here is short for r\ x ). However, 77* is just another 
notation for T(rj), and fi x o T(r) x ) is the identity map on T(X) by one of the monad laws. 
So we get P back when we apply this map to P. 

Thus, instead of identifying an extensive quantity on X by its action on arbitrary test 
functions <p : X — > R, we identify it by its action on one single test function, namely the 
function 7] x : X T (X) . 

I conjecture that T(X) is actually the end J Be ^r (X rh B) rhr B. 

Here is an important relationship between differentiation of extensive quantities on R, 
and of functions : R—> T(X); such functions can be differentiated, since T(X) is assumed 
to be KL as an P-module. (In the Schwartz theory, this relationship, with X = 1, serves as 
definition of derivative of distributions.) 

Proposition 13 For P e T(R) and <j> G R rh T(X), one has 

(P',<j>) = -{P^'). 

Proof. We are comparing two maps T(R) x (R rh T(X)) — > T(X), both of which are T-linear 
in the first variable. Therefore, it suffices to prove the equality for the case of P — 8 t ; in fact, 
by /?-bilinearity of the pairing, it suffices to prove that for any t G R and d G D, we have 

(d-{8 t )',4>) = -{S t ,d-V). 

The left hand side is (5, — 05* (5,), 0), and using bi-additivity of the pairing, this gives <f)(t) — 
((ce d )*)0)(f) = <j)(t)-<j){t + d), whichis -d-ij>'(t). 

Proposition [9] can be seen as a special case, with X = 1 (thus T(X) = R), and with (j) 
the identity function R — > R. We use the "integral" notation. Thus x denotes the identity 
function on R. So 

E(P') = ( xdP'(x) = - f {x)',dP{x) = - ( 1 dP(x), 
Jr Jr Jr 
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the middle equality by Proposition[T3l This, however, is — tot(P), by Proposition|2] 

The relationship between differentiation of extensive and intensive quantities on R ex- 
pressed in Proposition[l3]may be given a more "compact" formulation, using (fTTI i. For then 
we have, for P G T(R), that 

F ! = (p',r }R ) = -(p,n' R ). 

Thus in particular, knowledge of 77^ gives knowledge of P' for any P G T(R). It also gives 
knowledge of (j)' for any (j) : R — > V, with V a T-algebra which is KL module. For, any such 
<p extends over rjR to a (unique) T-linear F : T(R) — > V, so (j) — F o t]r; therefore 

f = (Foi lR y = FoT 1 > R , 

using ( IT6T >. 

Let us calculate ri' R : R — >• explicitly; we have for any d £ D and x £ R (writing 77 
for ?7 fl ) 

d ■ tj'(jc) = 77 (x+d) - tj (x) = S x+d -S x = -d-(S x )', 
so cancelling the universally quantified d, we get for any iGS that 

r?'(x) = -(5 i y. 

The first differentiation refers to differentiation of functions, the second to differentiation 
of distributions; it is tempting to write the former with a Newton dot; then we get 7] (x) = 
"(5,)'. 

The following depends on the Leibniz rule for differentaiating a product of two func- 
tions; so this is not valiud under he general assumptions of this Section, but needs the further 
assumption of Example 2, namely thet D consists of d G R with d 2 = 0, as in synthetic dif- 
ferential geometry. We shall then use "test function" technique to prove 

Proposition 14 For any P ET(R) and § G R 1+1 R, 

(Ph^)) / =P'h(/)+Ph^ / . 

Proof. It suffices to prove that for the "universal" test function 77 = r\x ■ X — > T(X), we 
have 

({Ph^)' ! 7 7 } = {P'h ( /.,77) + (Ph0',77). 

We calculate: 

((Ph </>)', 77) = -(Ph 0,77') (by PropositionlHll 
= - (P, <j) H 77'} (by Proposition^ 
= -(P,(0H77)'-f H77) 

using that Leibniz rule applies to any bilinear pairing, like H, 

= -(P,((/)H77)') + (P,(/)'H77) 
= (P / ,(/)H77> + (P^'H77> 
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using Proposition[l3]on the first summand 

= {P'\-(j>, n ) + (P\-(j>',r]) 
using Proposition |3] on each summand 

= (P' h <j)+P\- f ,77} 

In other words, (replacing H by •), the proof looks formally like the one from books on 
distribution theory, but does not depend on "sufficiently many test functions with values in 
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